Search CORE

175 research outputs found

Translocation and deletion breakpoints in cancer genomes are associated with potential non-B DNA-forming sequences

Author: Bacolla Albino
Cooper David Neil
Tainer John A.
Vasquez Karen M.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 15/04/2016
Field of study

Gross chromosomal rearrangements (including translocations, deletions, insertions and duplications) are a hallmark of cancer genomes and often create oncogenic fusion genes. An obligate step in the generation of such gross rearrangements is the formation of DNA double-strand breaks (DSBs). Since the genomic distribution of rearrangement breakpoints is non-random, intrinsic cellular factors may predispose certain genomic regions to breakage. Notably, certain DNA sequences with the potential to fold into secondary structures [potential non-B DNA structures (PONDS); e.g. triplexes, quadruplexes, hairpin/cruciforms, Z-DNA and single-stranded looped-out structures with implications in DNA replication and transcription] can stimulate the formation of DNA DSBs. Here, we tested the postulate that these DNA sequences might be found at, or in close proximity to, rearrangement breakpoints. By analyzing the distribution of PONDS-forming sequences within ±500 bases of 19 947 translocation and 46 365 sequence-characterized deletion breakpoints in cancer genomes, we find significant association between PONDS-forming repeats and cancer breakpoints. Specifically, (AT)n, (GAA)n and (GAAA)n constitute the most frequent repeats at translocation breakpoints, whereas A-tracts occur preferentially at deletion breakpoints. Translocation breakpoints near PONDS-forming repeats also recur in different individuals and patient tumor samples. Hence, PONDS-forming sequences represent an intrinsic risk factor for genomic rearrangements in cancer genomes

Online Research @ Cardiff

PubMed Central

Recommended from our members

Using Supercomputing Resources in Genomic Research

Author: Ahmed Zamal
Bacolla Albino
De-Paula Ruth B
Moiani Davide
Tainer John A
Tsai Chi-Lin
Ye Zu
Publication venue
Publication date: 01/01/2021
Field of study

TACC resources have proven to be critical and enabling to mine cancer genomic data, genomic variants associated with human disease and polymorphic human traits, addressing biological questions otherwise non-approachable by conventional experiments. We have developed computational scripts that we use in a parallel environment to harness the capabilities of TACC HPCs, and which we have made publicly available on GitHub. In selected peer-review publications acknowledging TACC support, we have reported the association of DNA sequences able to form alternative DNA structures (or non-B DNA) with sites of chromosomal breaks leading to gross chromosomal translocations in cancer genomes, with sites of gene duplication predisposing to Parkinson’s disease, and most recently with regions of increased polymorphism in the human population. We found an exquisite correlation between the expression of selected genes and the mutational burden in cancer patients. While solving the crystal structure of a poorly characterized exonuclease, named EXO5, TACC resources enabled the assignment of a role for EXO5 in the cellular response to DNA damage, a vital pathway used by tumors to survive and grow, along with key genes whose high expression is linked to poor survival in cancer patients. Most recently, during the discovery of a nuclear role for GRB2, an adaptor protein previously thought to act only in the cytoplasm, TACC resources enabled us to test hypotheses derived from laboratory data. We were gratified to confirm the laboratory prediction that high expression of GRB2, together with its binding partner the MRE11 nuclease, carries accurate prognostic power for poor patient survival in breast cancer patients proficient in DNA homology-directed repair. These composite findings, significantly facilitated by TACC resources, have been critical to further our understanding in biological processes relevant to human disease, and to provide knowledge for the development of more precise therapeutic tools aimed at improving human health

Texas ScholarWorks

Detection and characterization of local inverted repeats regularities

Author: A Bacolla
AH Tavares
G Benson
H Inagaki
J Kolb
R. Z. Cer
RZ Cer
W Kent
Y Du
Y Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2021
Field of study

To explore the inverted repeats regularities along the genome sequences, we propose a sliding window method to extract the concentration scores of inverted repeats periodic regularities and the total mass of possible inverted repeats pairs. We apply the method to the human genome and locate the regions with the potential for the formation of large number of hairpin/cruciform structures. The number of found windows with periodic regularities is small and the patterns of occurrence are chromosome specific.publishe

Crossref

Repositório Institucional da Universidade de Aveiro

Local DNA dynamics shape mutational patterns of mononucleotide repeats in human genomes

Author: Bacolla A.
Chen H.
Cooper David Neil
Howells Katy
Vasquez K. M.
Zhu X.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

Single base substitutions (SBSs) and insertions/deletions are critical for generating population diversity and can lead both to inherited disease and cancer. Whereas on a genome-wide scale SBSs are influenced by cellular factors, on a fine scale SBSs are influenced by the local DNA sequence-context, although the role of flanking sequence is often unclear. Herein, we used bioinformatics, molecular dynamics and hybrid quantum mechanics/molecular mechanics to analyze sequence context-dependent mutagenesis at mononucleotide repeats (A-tracts and G-tracts) in human population variation and in cancer genomes. SBSs and insertions/deletions occur predominantly at the first and last base-pairs of A-tracts, whereas they are concentrated at the second and third base-pairs in G-tracts. These positions correspond to the most flexible sites along A-tracts, and to sites where a ‘hole’, generated by the loss of an electron through oxidation, is most likely to be localized in G-tracts. For A-tracts, most SBSs occur in the direction of the base-pair flanking the tracts. We conclude that intrinsic features of local DNA structure, i.e. base-pair flexibility and charge transfer, render specific nucleotides along mononucleotide runs susceptible to base modification, which then yields mutations. Thus, local DNA dynamics contributes to phenotypic variation and disease in the human population

CiteSeerX

Online Research @ Cardiff

PubMed Central

Genome-Wide Analyses of Recombination Prone Regions Predict Role of DNA Structural Motif in Recombination

Author: A Bacolla
A Bacolla
A Bacolla
A Siddiqui-Jain
AJ Jeffreys
AK Todd
AM Zahler
AT Phan
C Wyman
D Sen
Darren P. Martin
DC Crawford
DE Gilbert
DJ Patel
DS Chekmenev
DT Kirkpatrick
EH Blackburn
G Ghosal
G Ghosal
H Arthanari
I Sandovici
IR Leith
J Zhang
JL Huppert
JL Huppert
JL Workman
JT Davis
K Halder
K Muniyappa
K Paeschke
L Kauppi
L Ying
LA Hanakahi
M Gellert
M Modesti
MB Gerstein
MN Weitzmann
N Kon
N Maizels
P Balagurumoorthy
P Fojtik
P Qiu
P Rawal
PJ Sabo
Prithvi Mani
R Giraldo
R Shenkar
RD Wells
RR Sinden
RR Sinden
S Anuradha
S Burge
S Myers
S Myers
SA McManus
SC Raghavan
Shantanu Chowdhury
SM Mirkin
Swapan Kumar Das
TD Petes
Vinod Kumar Yadav
VK Yadav
W Dunnick
W Winckler
Y Qin
Y Zhao
Z Du
Publication venue: Public Library of Science
Publication date: 02/02/2009
Field of study

HapMap findings reveal surprisingly asymmetric distribution of recombinogenic regions. Short recombinogenic regions (hotspots) are interspersed between large relatively non-recombinogenic regions. This raises the interesting possibility of DNA sequence and/or other cis- elements as determinants of recombination. We hypothesized the involvement of non-canonical sequences that can result in local non-B DNA structures and tested this using the G-quadruplex DNA as a model. G-quadruplex or G4 DNA is a unique form of four-stranded non-B DNA structure that engages certain G-rich sequences, presence of such motifs has been noted within telomeres. In support of this hypothesis, genome-wide computational analyses presented here reveal enrichment of potential G4 (PG4) DNA forming sequences within 25618 human hotspots relative to 9290 coldspots (p<0.0001). Furthermore, co-occurrence of PG4 DNA within several short sequence elements that are associated with recombinogenic regions was found to be significantly more than randomly expected. Interestingly, analyses of more than 50 DNA binding factors revealed that co-occurrence of PG4 DNA with target DNA binding sites of transcription factors c-Rel, NF-kappa B (p50 and p65) and Evi-1 was significantly enriched in recombination-prone regions. These observations support involvement of G4 DNA in recombination, predicting a functional model that is consistent with duplex-strand separation induced by formation of G4 motifs in supercoiled DNA and/or when assisted by other cellular factors

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Distinct sequence features underlie microdeletions and gross deletions in the human genome

Author: Bacolla Albino
Ball Edward V.
Cooper David N.
Kehrer-Sawatzki Hildegard
Qi Mengling
Stenson Peter D.
Tainer John A.
Zhao Huiying
Publication venue: 'Wiley'
Publication date: 01/03/2022
Field of study

Microdeletions and gross deletions are important causes (~20%) of human inherited disease and their genomic locations are strongly influenced by the local DNA sequence environment. This notwithstanding, no study has systematically examined their underlying generative mechanisms. Here, we obtained 42,098 pathogenic microdeletions and gross deletions from the Human Gene Mutation Database (HGMD) that together form a continuum of germline deletions ranging in size from 1bp to 28,394,429bp. We analyzed the DNA sequence within 1-kb of the breakpoint junctions and found that the frequencies of non-B DNA-forming repeats, GC-content, and the presence of seven of 78 specific sequence motifs in the vicinity of pathogenic deletions correlated with deletion length for deletions of length ≤30 bp. Further, we found that the presence of DR, GQ and STR repeats is important for the formation of longer deletions (>30 bp) but not for the formation of shorter deletions (≤30 bp) whilst significantly (Chi-square test P-value30 bp). We provide evidence to support a functional distinction between microdeletions and gross deletions. Finally, we propose that a deletion length cut-off of 25-30bp may serve as an objective means to functionally distinguish microdeletions from gross deletions

Online Research @ Cardiff

PubMed Central

The Role of Methylation in the Intrinsic Dynamics of B- and Z-DNA

Author: A Bacolla
A Herbert
A Herbert
A Perez
A Perez
A Perez
AF Rubin
AK Maunakea
Albino Bacolla
AR Brice
AT Phan
B Hartmann
B Heddi
B Heddi
BJ Killian
Brian T. Luke
C Rauch
C Rauch
C Rauch
Claudine Mayer
D Djuranovic
D Djuranovic
D Kim
DA Case
DL Beveridge
DN Cooper
Duncan E. Donohue
EF Pettersen
G Zheng
HK Srivastava
J Lee
J Srinivasan
J-P Ryckaert
Jack R. Collins
JL Crawford
JR Bothe
K Hornik
LJ Peck
M Behe
M Kulis
M Lee
M Orozco
MA Young
Nuri A. Temiz
NV Prabhu
P Varnai
P Várnai
PA Kollman
R Lavery
R Lavery
RD Wells
RP Sharma
RS Illingworth
S Bae
S Feng
SB Dixit
SC Ha
SM Mirkin
SY Ponomarev
T Darden
TE Cheatham
TE Cheatham
TE Cheatham 3rd
WL Jorgensen
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Methylation of cytosine at the 5-carbon position (5mC) is observed in both prokaryotes and eukaryotes. In humans, DNA methylation at CpG sites plays an important role in gene regulation and has been implicated in development, gene silencing, and cancer. In addition, the CpG dinucleotide is a known hot spot for pathologic mutations genome-wide. CpG tracts may adopt left-handed Z-DNA conformations, which have also been implicated in gene regulation and genomic instability. Methylation facilitates this B-Z transition but the underlying mechanism remains unclear. Herein, four structural models of the dinucleotide d(GC)5 repeat sequence in B-, methylated B-, Z-, and methylated Z-DNA forms were constructed and an aggregate 100 nanoseconds of molecular dynamics simulations in explicit solvent under physiological conditions was performed for each model. Both unmethylated and methylated B-DNA were found to be more flexible than Z-DNA. However, methylation significantly destabilized the BII, relative to the BI, state through the Gp5mC steps. In addition, methylation decreased the free energy difference between B- and Z-DNA. Comparisons of α/γ backbone torsional angles showed that torsional states changed marginally upon methylation for B-DNA, and Z-DNA. Methylation-induced conformational changes and lower energy differences may contribute to the transition to Z-DNA by methylated, over unmethylated, B-DNA and may be a contributing factor to biological function

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

DNA models of trinucleotide frameshift deletions: the formation of loops and bulges at the primer–template junction

Author: Bacolla
Bebenek
Bebenek
Beese
Benjamin C. Ponedel
Blanca
Cantor
Datta
Datta
Davis Jose
Garcia-Diaz
Garcia-Diaz
Hardman
Hardman
Hare
Hsieh
Johnson
Johnson
Johnson
Johnson
Kobayashi
Kunkel
Neil P. Johnson
Peter H. von Hippel
Rosen
Rosen
SantaLucia
Solie
Streisinger
Tippin
Walter A. Baase
Zuker
Publication venue: Oxford University Press
Publication date
Field of study

Although mechanisms of single-nucleotide residue deletion have been investigated, processes involved in the loss of longer nucleotide sequences during DNA replication are poorly understood. Previous reports have shown that in vitro replication of a 3′-TGC TGC template sequence can result in the deletion of one 3′-TGC. We have used low-energy circular dichroism (CD) and fluorescence spectroscopy to investigate the conformations and stabilities of DNA models of the replication intermediates that may be implicated in this frameshift. Pyrrolocytosine or 2-aminopurine residues, site-specifically substituted for cytosine or adenine in the vicinity of extruded base sequences, were used as spectroscopic probes to examine local DNA conformations. An equilibrium mixture of four hybridization conformations was observed when template bases looped-out as a bulge, i.e. a structure flanked on both sides by duplex DNA. In contrast, a single-loop structure with an unusual unstacked DNA conformation at its downstream edge was observed when the extruded bases were positioned at the primer–template junction, showing that misalignments can be modified by neighboring DNA secondary structure. These results must be taken into account in considering the genetic and biochemical mechanisms of frameshift mutagenesis in polymerase-driven DNA replication

Crossref

PubMed Central

Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes

Author: A. Bacolla
Akgun
B. T. Luke
Chandrasekhar
Collier
Courey
D'Angelo
Dayn
Glickman
Gordenin
Hill
Ho
Ho
Ho
J. R. Collins
K. H. Bruce
Kamenetskii
Kouzine
Krasilnikov
Kurahashi
Kuroda-Kawaguchi
Lange
Leonard
M. Yi
Mirkin
N. Volfovsky
R. M. Stephens
R. Z. Cer
Rich
Ristic
Rohs
Schroth
Sheridan
Singleton
Stajich
Stein
U. S. Mudunuri
Wang
Wells
Wittig
Zhang
Zhao
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov

CiteSeerX

Crossref

PubMed Central

Controlled Chaos of Polymorphic Mucins in a Metazoan Parasite (Schistosoma mansoni) Interacting with Its Invertebrate Host (Biomphalaria glabrata)

Author: A Bacolla
A Bacolla
A Loukas
A Rajkovic
A Theron
A Theron
AS Edge
B Gryseels
BB Finlay
Benjamin Gourbal
BL Hibner
C Caudevilla
C Roth
C Vitte
CA Buscaglia
CA Reynaud
Christoph Grunau
CM Adema
CY Hayashi
CY Hung
CY Hung
Céline Cosseau
D Ebert
D Liao
D Liao
E Roger
E Roger
Emmanuel Roger
F Paques
F Tajima
FL Watson
G Cheng
G Ferbeyre
G Mitta
G Mitta
G Theodoropoulos
G Wang
GG Doxiadis
Guillaume Mitta
H Hirai
H Hirai
H Johannesson
H Puchta
Hirohisa Hirai
Italo M. Cesari
JE Taylor
JJ Lopez-Rubio
JM Di Noia
JP Cannon
JW Szostak
K Julenius
K Kapp
KA Brayton
KJ Hertel
L Van Valen
M De la Pena
M Martick
M Navarro
M Nei
M Nei
M Suyama
MC Le Paslier
MD Canny
MS Marin
Paul J. Brindley
R Medzhitov
R Przybilski
R Rigatti
Raymond J. Pierce
RE Davis
Richard Galinier
RJ Dixon
RM Linzmeier
RM Linzmeier
Rémi Emans
S Tonegawa
S Verjovski-Almeida
SA Frantz
SA Kyes
SA Ralph
SJ Hicks
SM Zhang
WF Patton
Y Dong
Z Pancer
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Invertebrates were long thought to possess only a simple, effective and hence non-adaptive defence system against microbial and parasitic attacks. However, recent studies have shown that invertebrate immunity also relies on immune receptors that diversify (e.g. in echinoderms, insects and mollusks (Biomphalaria glabrata)). Apparently, individual or population-based polymorphism-generating mechanisms exists that permit the survival of invertebrate species exposed to parasites. Consequently, the generally accepted arms race hypothesis predicts that molecular diversity and polymorphism also exist in parasites of invertebrates. We investigated the diversity and polymorphism of parasite molecules (Schistosoma mansoni Polymorphic Mucins, SmPoMucs) that are key factors for the compatibility of schistosomes interacting with their host, the mollusc Biomphalaria glabrata. We have elucidated the complex cascade of mechanisms acting both at the genomic level and during expression that confer polymorphism to SmPoMuc. We show that SmPoMuc is coded by a multi-gene family whose members frequently recombine. We show that these genes are transcribed in an individual-specific manner, and that for each gene, multiple splice variants exist. Finally, we reveal the impact of this polymorphism on the SmPoMuc glycosylation status. Our data support the view that S. mansoni has evolved a complex hierarchical system that efficiently generates a high degree of polymorphism—a “controlled chaos”—based on a relatively low number of genes. This contrasts with protozoan parasites that generate antigenic variation from large sets of genes such as Trypanosoma cruzi, Trypanosoma brucei and Plasmodium falciparum. Our data support the view that the interaction between parasites and their invertebrate hosts are far more complex than previously thought. While most studies in this matter have focused on invertebrate host diversification, we clearly show that diversifying mechanisms also exist on the parasite side of the interaction. Our findings shed new light on how and why invertebrate immunity develops

Public Library of Science (PLOS)

Crossref

HAL-Inserm

Directory of Open Access Journals

PubMed Central

HAL Descartes

DI-fusion